Using Spot Instance SLAs for Reliable Cloud Federation
نویسندگان
چکیده
Spot instances are a commonly offered by IaaS cloud providers to opportunistically utilize spare capacity and meet temporary user demand for additional resources at low cost. Although the availability of service SLAs is a core paradigm of cloud computing, spot instances typically come without any service quality guarantees. We aim to extend the spot instance service to provide SLAs for eviction probability, based on the user estimate of the maximum expected instance lifetime. In addition to providing users with better usability and ahead-of-time quality of service guarantees, this statistical certainty also opens the door to cloud-to-cloud federation of workloads. For this federation to be possible, however, the statistical guarantees must be adhered to strictly, for a wide range of real-world workloads, at cloud scale. To this end, we propose a new approach to providing SLAs on the time-until-eviction for spot instances. We employ Monte-Carlo simulation to compute the quantiles of the conditional distributions of future spot instances for different available capacity levels. An IaaS cloud scheduler then uses these quantiles to determine when to provision federated requests in order to maintain an SLA at a specific target eviction probability for spot instances. We investigate the reliability of such SLA enforcement using synthetic and realworld traces, test its viability for cloud-to-cloud workload federation, and provide an in-depth analysis of trade-offs and cost factors of such federation.
منابع مشابه
Providing Lifetime Service-Level-Agreements for Cloud Spot Instances
Spot instances are commonly offered by IaaS cloud providers to opportunistically utilize spare capacity and meet temporary user demand for additional resources. Although the availability of service SLAs is a core paradigm of cloud computing, spot instances in practice still come without any service quality guarantees. We aim to extend the spot instance service to provide a probabilistic SLA for...
متن کاملContrail Virtual Execution Platform Challenges in Being Part of a Cloud Federation
Cloud computing is quickly defining the computing paradigm in the modern networked age. Users can run their large computations online using cloud services at a fraction of the cost compared to setting their own data centers. Clearly cloud computing offers many advantages, and yet many large organizations including governments, financial sector, and health care sector are reluctant in transition...
متن کاملContrail Virtual Execution Platform Challenges in Being Part of a Cloud Federation - (Invited Paper)
Cloud computing is quickly defining the computing paradigm in the modern networked age. Users can run their large computations online using cloud services at a fraction of the cost compared to setting their own data centers. Clearly cloud computing offers many advantages, and yet many large organizations including governments, financial sector, and health care sector are reluctant in transition...
متن کاملCAP: A Cloud Auto-Provisioning Framework for Parallel Processing Using On-demand and Spot Instances
Cloud computing has drawn increasing attention from the scientific computing community due to its ease of use, elasticity, and relatively low cost. Because a high-performance computing (HPC) application is usually resource demanding, without careful planning, it can incur a high monetary expense even in Cloud. We design a tool called CAP (Cloud AutoProvisioning framework for Parallel Processing...
متن کاملAutonomie, sécurité et QoS de bout en bout dans un environnement de Cloud Computing. (Security, QoS and self-management within an end-to-end Cloud Computing environment)
Today, Cloud Networking is one of the recent research areas within the Cloud Computing research communities. The main challenges of Cloud Networking concern Quality of Service (QoS) and security guarantee as well as its management in conformance with a corresponding Service Level Agreement (SLA). In this thesis, we propose a framework for resource allocation according to an end-to-end SLA estab...
متن کامل